Picture for Bin Chen

Bin Chen

OctoT2I: A Self-Evolving Agentic Text-to-Image Router

Add code
Jun 01, 2026
Viaarxiv icon

CAPF: Guiding Search-Agent Rollouts with Credit-Attenuated Privileged Feedback

Add code
Jun 01, 2026
Viaarxiv icon

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

Add code
May 28, 2026
Viaarxiv icon

Reasoning Matters: Mitigate Hallucination in Multimodal Large Reasoning Models via Reasoning-Conditioned Preference Optimization

Add code
May 27, 2026
Viaarxiv icon

CVSearch: Empowering Multimodal LLMs with Cognitive Visual Search for High-Resolution Image Perception

Add code
May 22, 2026
Viaarxiv icon

Molecular Lead Optimization via Agentic Tool Planning

Add code
May 21, 2026
Viaarxiv icon

FlowErase-RL: Rethinking Concept Erasure as Reward Optimization in Flow Matching Models

Add code
May 19, 2026
Viaarxiv icon

Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation

Add code
May 19, 2026
Viaarxiv icon

CPC-VAR:Continual Personalized and Compositional Generation in Visual Autoregressive Models

Add code
May 19, 2026
Viaarxiv icon

Mistletoe: Stealthy Acceleration-Collapse Attacks on Speculative Decoding

Add code
May 13, 2026
Viaarxiv icon